Overview
Brought to you by YData
Dataset statistics
| Number of variables | 12 |
|---|---|
| Number of observations | 418 |
| Missing cells | 414 |
| Missing cells (%) | 8.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 39.3 KiB |
| Average record size in memory | 96.3 B |
Variable types
| Numeric | 6 |
|---|---|
| Categorical | 3 |
| Text | 3 |
PassengerId is highly overall correlated with Unnamed: 0 | High correlation |
Unnamed: 0 is highly overall correlated with PassengerId | High correlation |
Age has 86 (20.6%) missing values | Missing |
Cabin has 327 (78.2%) missing values | Missing |
Unnamed: 0 is uniformly distributed | Uniform |
PassengerId is uniformly distributed | Uniform |
Unnamed: 0 has unique values | Unique |
PassengerId has unique values | Unique |
Name has unique values | Unique |
SibSp has 283 (67.7%) zeros | Zeros |
Parch has 324 (77.5%) zeros | Zeros |
Reproduction
| Analysis started | 2025-06-24 00:22:17.282488 |
|---|---|
| Analysis finished | 2025-06-24 00:22:21.238756 |
| Duration | 3.96 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
Unnamed: 0
Real number (ℝ)
High correlation  Uniform  Unique 
| Distinct | 418 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 208.5 |
| Minimum | 0 |
|---|---|
| Maximum | 417 |
| Zeros | 1 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 20.85 |
| Q1 | 104.25 |
| median | 208.5 |
| Q3 | 312.75 |
| 95-th percentile | 396.15 |
| Maximum | 417 |
| Range | 417 |
| Interquartile range (IQR) | 208.5 |
Descriptive statistics
| Standard deviation | 120.81046 |
|---|---|
| Coefficient of variation (CV) | 0.57942666 |
| Kurtosis | -1.2 |
| Mean | 208.5 |
| Median Absolute Deviation (MAD) | 104.5 |
| Skewness | 0 |
| Sum | 87153 |
| Variance | 14595.167 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 417 | 1 | 0.2% |
| 0 | 1 | 0.2% |
| 401 | 1 | 0.2% |
| 400 | 1 | 0.2% |
| 399 | 1 | 0.2% |
| 398 | 1 | 0.2% |
| 397 | 1 | 0.2% |
| 396 | 1 | 0.2% |
| 395 | 1 | 0.2% |
| 394 | 1 | 0.2% |
| Other values (408) | 408 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 417 | 1 | |
| 416 | 1 | |
| 415 | 1 | |
| 414 | 1 | |
| 413 | 1 | |
| 412 | 1 | |
| 411 | 1 | |
| 410 | 1 | |
| 409 | 1 | |
| 408 | 1 |
PassengerId
Real number (ℝ)
High correlation  Uniform  Unique 
| Distinct | 418 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1100.5 |
| Minimum | 892 |
|---|---|
| Maximum | 1309 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 KiB |
Quantile statistics
| Minimum | 892 |
|---|---|
| 5-th percentile | 912.85 |
| Q1 | 996.25 |
| median | 1100.5 |
| Q3 | 1204.75 |
| 95-th percentile | 1288.15 |
| Maximum | 1309 |
| Range | 417 |
| Interquartile range (IQR) | 208.5 |
Descriptive statistics
| Standard deviation | 120.81046 |
|---|---|
| Coefficient of variation (CV) | 0.10977779 |
| Kurtosis | -1.2 |
| Mean | 1100.5 |
| Median Absolute Deviation (MAD) | 104.5 |
| Skewness | 0 |
| Sum | 460009 |
| Variance | 14595.167 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 1309 | 1 | 0.2% |
| 892 | 1 | 0.2% |
| 1293 | 1 | 0.2% |
| 1292 | 1 | 0.2% |
| 1291 | 1 | 0.2% |
| 1290 | 1 | 0.2% |
| 1289 | 1 | 0.2% |
| 1288 | 1 | 0.2% |
| 1287 | 1 | 0.2% |
| 1286 | 1 | 0.2% |
| Other values (408) | 408 |
| Value | Count | Frequency (%) |
| 892 | 1 | |
| 893 | 1 | |
| 894 | 1 | |
| 895 | 1 | |
| 896 | 1 | |
| 897 | 1 | |
| 898 | 1 | |
| 899 | 1 | |
| 900 | 1 | |
| 901 | 1 |
| Value | Count | Frequency (%) |
| 1309 | 1 | |
| 1308 | 1 | |
| 1307 | 1 | |
| 1306 | 1 | |
| 1305 | 1 | |
| 1304 | 1 | |
| 1303 | 1 | |
| 1302 | 1 | |
| 1301 | 1 | |
| 1300 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 3 |
| 3rd row | 2 |
| 4th row | 3 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 218 | |
| 1 | 107 | |
| 2 | 93 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 218 | |
| 1 | 107 | |
| 2 | 93 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 218 | |
| 1 | 107 | |
| 2 | 93 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 418 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3 | 218 | |
| 1 | 107 | |
| 2 | 93 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 418 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3 | 218 | |
| 1 | 107 | |
| 2 | 93 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 418 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3 | 218 | |
| 1 | 107 | |
| 2 | 93 |
Name
Text
Unique 
| Distinct | 418 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 KiB |
Length
| Max length | 63 |
|---|---|
| Median length | 51 |
| Mean length | 27.483254 |
| Min length | 13 |
Unique
| Unique | 418 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Kelly, Mr. James |
|---|---|
| 2nd row | Wilkes, Mrs. James (Ellen Needs) |
| 3rd row | Myles, Mr. Thomas Francis |
| 4th row | Wirz, Mr. Albert |
| 5th row | Hirvonen, Mrs. Alexander (Helga E Lindqvist) |
| Value | Count | Frequency (%) |
| mr | 242 | 14.0% |
| miss | 78 | 4.5% |
| mrs | 72 | 4.2% |
| john | 28 | 1.6% |
| william | 23 | 1.3% |
| master | 21 | 1.2% |
| charles | 16 | 0.9% |
| joseph | 15 | 0.9% |
| thomas | 14 | 0.8% |
| james | 14 | 0.8% |
| Other values (825) | 1202 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1309 | 11.4% | |
| r | 971 | 8.5% |
| e | 822 | 7.2% |
| a | 786 | 6.8% |
| s | 628 | 5.5% |
| i | 621 | 5.4% |
| n | 596 | 5.2% |
| l | 526 | 4.6% |
| M | 515 | 4.5% |
| o | 467 | 4.1% |
| Other values (48) | 4247 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 11488 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1309 | 11.4% | |
| r | 971 | 8.5% |
| e | 822 | 7.2% |
| a | 786 | 6.8% |
| s | 628 | 5.5% |
| i | 621 | 5.4% |
| n | 596 | 5.2% |
| l | 526 | 4.6% |
| M | 515 | 4.5% |
| o | 467 | 4.1% |
| Other values (48) | 4247 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 11488 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1309 | 11.4% | |
| r | 971 | 8.5% |
| e | 822 | 7.2% |
| a | 786 | 6.8% |
| s | 628 | 5.5% |
| i | 621 | 5.4% |
| n | 596 | 5.2% |
| l | 526 | 4.6% |
| M | 515 | 4.5% |
| o | 467 | 4.1% |
| Other values (48) | 4247 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 11488 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1309 | 11.4% | |
| r | 971 | 8.5% |
| e | 822 | 7.2% |
| a | 786 | 6.8% |
| s | 628 | 5.5% |
| i | 621 | 5.4% |
| n | 596 | 5.2% |
| l | 526 | 4.6% |
| M | 515 | 4.5% |
| o | 467 | 4.1% |
| Other values (48) | 4247 |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.7272727 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | male |
|---|---|
| 2nd row | female |
| 3rd row | male |
| 4th row | male |
| 5th row | female |
Common Values
| Value | Count | Frequency (%) |
| male | 266 | |
| female | 152 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| male | 266 | |
| female | 152 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 570 | |
| m | 418 | |
| a | 418 | |
| l | 418 | |
| f | 152 | 7.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1976 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 570 | |
| m | 418 | |
| a | 418 | |
| l | 418 | |
| f | 152 | 7.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1976 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 570 | |
| m | 418 | |
| a | 418 | |
| l | 418 | |
| f | 152 | 7.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1976 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 570 | |
| m | 418 | |
| a | 418 | |
| l | 418 | |
| f | 152 | 7.7% |
Age
Real number (ℝ)
Missing 
| Distinct | 79 |
|---|---|
| Distinct (%) | 23.8% |
| Missing | 86 |
| Missing (%) | 20.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.27259 |
| Minimum | 0.17 |
|---|---|
| Maximum | 76 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 KiB |
Quantile statistics
| Minimum | 0.17 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 21 |
| median | 27 |
| Q3 | 39 |
| 95-th percentile | 57 |
| Maximum | 76 |
| Range | 75.83 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 14.181209 |
|---|---|
| Coefficient of variation (CV) | 0.46845047 |
| Kurtosis | 0.083783352 |
| Mean | 30.27259 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.45736129 |
| Sum | 10050.5 |
| Variance | 201.1067 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 21 | 17 | 4.1% |
| 24 | 17 | 4.1% |
| 22 | 16 | 3.8% |
| 30 | 15 | 3.6% |
| 18 | 13 | 3.1% |
| 27 | 12 | 2.9% |
| 26 | 12 | 2.9% |
| 23 | 11 | 2.6% |
| 25 | 11 | 2.6% |
| 29 | 10 | 2.4% |
| Other values (69) | 198 | |
| (Missing) | 86 |
| Value | Count | Frequency (%) |
| 0.17 | 1 | 0.2% |
| 0.33 | 1 | 0.2% |
| 0.75 | 1 | 0.2% |
| 0.83 | 1 | 0.2% |
| 0.92 | 1 | 0.2% |
| 1 | 3 | |
| 2 | 2 | |
| 3 | 1 | 0.2% |
| 5 | 1 | 0.2% |
| 6 | 3 |
| Value | Count | Frequency (%) |
| 76 | 1 | 0.2% |
| 67 | 1 | 0.2% |
| 64 | 3 | |
| 63 | 2 | |
| 62 | 1 | 0.2% |
| 61 | 2 | |
| 60.5 | 1 | 0.2% |
| 60 | 3 | |
| 59 | 1 | 0.2% |
| 58 | 1 | 0.2% |
SibSp
Real number (ℝ)
Zeros 
| Distinct | 7 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.44736842 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 283 |
| Zeros (%) | 67.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.89675956 |
|---|---|
| Coefficient of variation (CV) | 2.0045214 |
| Kurtosis | 26.498712 |
| Mean | 0.44736842 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.1683366 |
| Sum | 187 |
| Variance | 0.80417771 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 283 | |
| 1 | 110 | 26.3% |
| 2 | 14 | 3.3% |
| 3 | 4 | 1.0% |
| 4 | 4 | 1.0% |
| 8 | 2 | 0.5% |
| 5 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 283 | |
| 1 | 110 | 26.3% |
| 2 | 14 | 3.3% |
| 3 | 4 | 1.0% |
| 4 | 4 | 1.0% |
| 5 | 1 | 0.2% |
| 8 | 2 | 0.5% |
| Value | Count | Frequency (%) |
| 8 | 2 | 0.5% |
| 5 | 1 | 0.2% |
| 4 | 4 | 1.0% |
| 3 | 4 | 1.0% |
| 2 | 14 | 3.3% |
| 1 | 110 | 26.3% |
| 0 | 283 |
Parch
Real number (ℝ)
Zeros 
| Distinct | 8 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3923445 |
| Minimum | 0 |
|---|---|
| Maximum | 9 |
| Zeros | 324 |
| Zeros (%) | 77.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.98142888 |
|---|---|
| Coefficient of variation (CV) | 2.5014468 |
| Kurtosis | 31.412513 |
| Mean | 0.3923445 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.6544617 |
| Sum | 164 |
| Variance | 0.96320264 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 324 | |
| 1 | 52 | 12.4% |
| 2 | 33 | 7.9% |
| 3 | 3 | 0.7% |
| 4 | 2 | 0.5% |
| 9 | 2 | 0.5% |
| 6 | 1 | 0.2% |
| 5 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 324 | |
| 1 | 52 | 12.4% |
| 2 | 33 | 7.9% |
| 3 | 3 | 0.7% |
| 4 | 2 | 0.5% |
| 5 | 1 | 0.2% |
| 6 | 1 | 0.2% |
| 9 | 2 | 0.5% |
| Value | Count | Frequency (%) |
| 9 | 2 | 0.5% |
| 6 | 1 | 0.2% |
| 5 | 1 | 0.2% |
| 4 | 2 | 0.5% |
| 3 | 3 | 0.7% |
| 2 | 33 | 7.9% |
| 1 | 52 | 12.4% |
| 0 | 324 |
Ticket
Text
| Distinct | 363 |
|---|---|
| Distinct (%) | 86.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 KiB |
Length
| Max length | 18 |
|---|---|
| Median length | 17 |
| Mean length | 6.8755981 |
| Min length | 3 |
Unique
| Unique | 321 ? |
|---|---|
| Unique (%) | 76.8% |
Sample
| 1st row | 330911 |
|---|---|
| 2nd row | 363272 |
| 3rd row | 240276 |
| 4th row | 315154 |
| 5th row | 3101298 |
| Value | Count | Frequency (%) |
| pc | 32 | 5.9% |
| c.a | 19 | 3.5% |
| soton/o.q | 8 | 1.5% |
| ca | 8 | 1.5% |
| sc/paris | 7 | 1.3% |
| 17608 | 5 | 0.9% |
| a/5 | 5 | 0.9% |
| 2 | 5 | 0.9% |
| w./c | 5 | 0.9% |
| 2343 | 4 | 0.7% |
| Other values (383) | 445 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 364 | |
| 1 | 311 | |
| 2 | 268 | |
| 7 | 207 | 7.2% |
| 6 | 206 | 7.2% |
| 0 | 204 | 7.1% |
| 5 | 195 | 6.8% |
| 4 | 188 | 6.5% |
| 8 | 144 | 5.0% |
| 9 | 137 | 4.8% |
| Other values (22) | 650 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2874 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3 | 364 | |
| 1 | 311 | |
| 2 | 268 | |
| 7 | 207 | 7.2% |
| 6 | 206 | 7.2% |
| 0 | 204 | 7.1% |
| 5 | 195 | 6.8% |
| 4 | 188 | 6.5% |
| 8 | 144 | 5.0% |
| 9 | 137 | 4.8% |
| Other values (22) | 650 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2874 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3 | 364 | |
| 1 | 311 | |
| 2 | 268 | |
| 7 | 207 | 7.2% |
| 6 | 206 | 7.2% |
| 0 | 204 | 7.1% |
| 5 | 195 | 6.8% |
| 4 | 188 | 6.5% |
| 8 | 144 | 5.0% |
| 9 | 137 | 4.8% |
| Other values (22) | 650 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2874 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3 | 364 | |
| 1 | 311 | |
| 2 | 268 | |
| 7 | 207 | 7.2% |
| 6 | 206 | 7.2% |
| 0 | 204 | 7.1% |
| 5 | 195 | 6.8% |
| 4 | 188 | 6.5% |
| 8 | 144 | 5.0% |
| 9 | 137 | 4.8% |
| Other values (22) | 650 |
Fare
Real number (ℝ)
| Distinct | 169 |
|---|---|
| Distinct (%) | 40.5% |
| Missing | 1 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35.627188 |
| Minimum | 0 |
|---|---|
| Maximum | 512.3292 |
| Zeros | 2 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 7.2292 |
| Q1 | 7.8958 |
| median | 14.4542 |
| Q3 | 31.5 |
| 95-th percentile | 151.55 |
| Maximum | 512.3292 |
| Range | 512.3292 |
| Interquartile range (IQR) | 23.6042 |
Descriptive statistics
| Standard deviation | 55.907576 |
|---|---|
| Coefficient of variation (CV) | 1.5692391 |
| Kurtosis | 17.921595 |
| Mean | 35.627188 |
| Median Absolute Deviation (MAD) | 6.825 |
| Skewness | 3.6872133 |
| Sum | 14856.538 |
| Variance | 3125.6571 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7.75 | 21 | 5.0% |
| 26 | 19 | 4.5% |
| 13 | 17 | 4.1% |
| 8.05 | 17 | 4.1% |
| 7.8958 | 11 | 2.6% |
| 10.5 | 11 | 2.6% |
| 7.775 | 10 | 2.4% |
| 7.225 | 9 | 2.2% |
| 7.2292 | 9 | 2.2% |
| 8.6625 | 8 | 1.9% |
| Other values (159) | 285 |
| Value | Count | Frequency (%) |
| 0 | 2 | 0.5% |
| 3.1708 | 1 | 0.2% |
| 6.4375 | 2 | 0.5% |
| 6.4958 | 1 | 0.2% |
| 6.95 | 1 | 0.2% |
| 7 | 2 | 0.5% |
| 7.05 | 2 | 0.5% |
| 7.225 | 9 | |
| 7.2292 | 9 | |
| 7.25 | 5 |
| Value | Count | Frequency (%) |
| 512.3292 | 1 | 0.2% |
| 263 | 2 | 0.5% |
| 262.375 | 5 | |
| 247.5208 | 1 | 0.2% |
| 227.525 | 1 | 0.2% |
| 221.7792 | 3 | |
| 211.5 | 4 | |
| 211.3375 | 1 | 0.2% |
| 164.8667 | 2 | 0.5% |
| 151.55 | 2 | 0.5% |
Cabin
Text
Missing 
| Distinct | 76 |
|---|---|
| Distinct (%) | 83.5% |
| Missing | 327 |
| Missing (%) | 78.2% |
| Memory size | 3.4 KiB |
Length
| Max length | 15 |
|---|---|
| Median length | 3 |
| Mean length | 4.0769231 |
| Min length | 1 |
Unique
| Unique | 62 ? |
|---|---|
| Unique (%) | 68.1% |
Sample
| 1st row | B45 |
|---|---|
| 2nd row | E31 |
| 3rd row | B57 B59 B63 B66 |
| 4th row | B36 |
| 5th row | A21 |
| Value | Count | Frequency (%) |
| f | 4 | 3.4% |
| b57 | 3 | 2.5% |
| b63 | 3 | 2.5% |
| b66 | 3 | 2.5% |
| b59 | 3 | 2.5% |
| b45 | 2 | 1.7% |
| c25 | 2 | 1.7% |
| c27 | 2 | 1.7% |
| c78 | 2 | 1.7% |
| c23 | 2 | 1.7% |
| Other values (80) | 92 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 43 | |
| 5 | 34 | |
| 1 | 33 | 8.9% |
| B | 32 | 8.6% |
| 6 | 30 | 8.1% |
| 3 | 28 | 7.5% |
| 27 | 7.3% | |
| 2 | 25 | 6.7% |
| 4 | 21 | 5.7% |
| 7 | 15 | 4.0% |
| Other values (8) | 83 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 371 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| C | 43 | |
| 5 | 34 | |
| 1 | 33 | 8.9% |
| B | 32 | 8.6% |
| 6 | 30 | 8.1% |
| 3 | 28 | 7.5% |
| 27 | 7.3% | |
| 2 | 25 | 6.7% |
| 4 | 21 | 5.7% |
| 7 | 15 | 4.0% |
| Other values (8) | 83 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 371 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| C | 43 | |
| 5 | 34 | |
| 1 | 33 | 8.9% |
| B | 32 | 8.6% |
| 6 | 30 | 8.1% |
| 3 | 28 | 7.5% |
| 27 | 7.3% | |
| 2 | 25 | 6.7% |
| 4 | 21 | 5.7% |
| 7 | 15 | 4.0% |
| Other values (8) | 83 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 371 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| C | 43 | |
| 5 | 34 | |
| 1 | 33 | 8.9% |
| B | 32 | 8.6% |
| 6 | 30 | 8.1% |
| 3 | 28 | 7.5% |
| 27 | 7.3% | |
| 2 | 25 | 6.7% |
| 4 | 21 | 5.7% |
| 7 | 15 | 4.0% |
| Other values (8) | 83 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Q |
|---|---|
| 2nd row | S |
| 3rd row | Q |
| 4th row | S |
| 5th row | S |
Common Values
| Value | Count | Frequency (%) |
| S | 270 | |
| C | 102 | 24.4% |
| Q | 46 | 11.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| s | 270 | |
| c | 102 | 24.4% |
| q | 46 | 11.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 270 | |
| C | 102 | 24.4% |
| Q | 46 | 11.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 418 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| S | 270 | |
| C | 102 | 24.4% |
| Q | 46 | 11.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 418 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| S | 270 | |
| C | 102 | 24.4% |
| Q | 46 | 11.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 418 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| S | 270 | |
| C | 102 | 24.4% |
| Q | 46 | 11.0% |
Interactions
Correlations
| Age | Embarked | Fare | Parch | PassengerId | Pclass | Sex | SibSp | Unnamed: 0 | |
|---|---|---|---|---|---|---|---|---|---|
| Age | 1.000 | 0.135 | 0.315 | -0.130 | -0.019 | 0.349 | 0.000 | -0.015 | -0.019 |
| Embarked | 0.135 | 1.000 | 0.240 | 0.113 | 0.060 | 0.308 | 0.109 | 0.101 | 0.060 |
| Fare | 0.315 | 0.240 | 1.000 | 0.378 | 0.020 | 0.475 | 0.154 | 0.441 | 0.020 |
| Parch | -0.130 | 0.113 | 0.378 | 1.000 | 0.051 | 0.000 | 0.213 | 0.412 | 0.051 |
| PassengerId | -0.019 | 0.060 | 0.020 | 0.051 | 1.000 | 0.054 | 0.000 | -0.010 | 1.000 |
| Pclass | 0.349 | 0.308 | 0.475 | 0.000 | 0.054 | 1.000 | 0.106 | 0.113 | 0.054 |
| Sex | 0.000 | 0.109 | 0.154 | 0.213 | 0.000 | 0.106 | 1.000 | 0.136 | 0.000 |
| SibSp | -0.015 | 0.101 | 0.441 | 0.412 | -0.010 | 0.113 | 0.136 | 1.000 | -0.010 |
| Unnamed: 0 | -0.019 | 0.060 | 0.020 | 0.051 | 1.000 | 0.054 | 0.000 | -0.010 | 1.000 |
Missing values
Sample
| Unnamed: 0 | PassengerId | Pclass | Name | Sex | Age | SibSp | Parch | Ticket | Fare | Cabin | Embarked | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 892 | 3 | Kelly, Mr. James | male | 34.5 | 0 | 0 | 330911 | 7.8292 | NaN | Q |
| 1 | 1 | 893 | 3 | Wilkes, Mrs. James (Ellen Needs) | female | 47.0 | 1 | 0 | 363272 | 7.0000 | NaN | S |
| 2 | 2 | 894 | 2 | Myles, Mr. Thomas Francis | male | 62.0 | 0 | 0 | 240276 | 9.6875 | NaN | Q |
| 3 | 3 | 895 | 3 | Wirz, Mr. Albert | male | 27.0 | 0 | 0 | 315154 | 8.6625 | NaN | S |
| 4 | 4 | 896 | 3 | Hirvonen, Mrs. Alexander (Helga E Lindqvist) | female | 22.0 | 1 | 1 | 3101298 | 12.2875 | NaN | S |
| 5 | 5 | 897 | 3 | Svensson, Mr. Johan Cervin | male | 14.0 | 0 | 0 | 7538 | 9.2250 | NaN | S |
| 6 | 6 | 898 | 3 | Connolly, Miss. Kate | female | 30.0 | 0 | 0 | 330972 | 7.6292 | NaN | Q |
| 7 | 7 | 899 | 2 | Caldwell, Mr. Albert Francis | male | 26.0 | 1 | 1 | 248738 | 29.0000 | NaN | S |
| 8 | 8 | 900 | 3 | Abrahim, Mrs. Joseph (Sophie Halaut Easu) | female | 18.0 | 0 | 0 | 2657 | 7.2292 | NaN | C |
| 9 | 9 | 901 | 3 | Davies, Mr. John Samuel | male | 21.0 | 2 | 0 | A/4 48871 | 24.1500 | NaN | S |
| Unnamed: 0 | PassengerId | Pclass | Name | Sex | Age | SibSp | Parch | Ticket | Fare | Cabin | Embarked | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 408 | 408 | 1300 | 3 | Riordan, Miss. Johanna Hannah"" | female | NaN | 0 | 0 | 334915 | 7.7208 | NaN | Q |
| 409 | 409 | 1301 | 3 | Peacock, Miss. Treasteall | female | 3.0 | 1 | 1 | SOTON/O.Q. 3101315 | 13.7750 | NaN | S |
| 410 | 410 | 1302 | 3 | Naughton, Miss. Hannah | female | NaN | 0 | 0 | 365237 | 7.7500 | NaN | Q |
| 411 | 411 | 1303 | 1 | Minahan, Mrs. William Edward (Lillian E Thorpe) | female | 37.0 | 1 | 0 | 19928 | 90.0000 | C78 | Q |
| 412 | 412 | 1304 | 3 | Henriksson, Miss. Jenny Lovisa | female | 28.0 | 0 | 0 | 347086 | 7.7750 | NaN | S |
| 413 | 413 | 1305 | 3 | Spector, Mr. Woolf | male | NaN | 0 | 0 | A.5. 3236 | 8.0500 | NaN | S |
| 414 | 414 | 1306 | 1 | Oliva y Ocana, Dona. Fermina | female | 39.0 | 0 | 0 | PC 17758 | 108.9000 | C105 | C |
| 415 | 415 | 1307 | 3 | Saether, Mr. Simon Sivertsen | male | 38.5 | 0 | 0 | SOTON/O.Q. 3101262 | 7.2500 | NaN | S |
| 416 | 416 | 1308 | 3 | Ware, Mr. Frederick | male | NaN | 0 | 0 | 359309 | 8.0500 | NaN | S |
| 417 | 417 | 1309 | 3 | Peter, Master. Michael J | male | NaN | 1 | 1 | 2668 | 22.3583 | NaN | C |